Improving the kelly-lochbaum vocal tract model using conical tube sections and fractional delay filtering techniques
نویسندگان
چکیده
An articulatory model of speech production is usually constructed by approximating the profile of the vocal tract using cylindrical tube sections. This is implemented by a digital ladder filter that is called the Kelly–Lochbaum model. In this paper we propose an extended approach, where the tube sections approximating the profile of the tract are conical instead of cylindrical. Furthermore, the length of each tube section in our model can be accurately controlled using a novel fractional delay filtering scheme. These refinements result in an accurate and intuitively controllable vocal tract model that is well suited for articulatory speech synthesis.
منابع مشابه
Articulatory speech synthesis based on fractional delay waveguide filters
An extension to the traditional Kelly-Lochbaum vocal tract model is introduced. In the new model not only the diameter but also the length of each tube section can be continuously adjusted. This is achieved by using fractional delay filter techniques such as interpolation and deinterpolation. The filter structure consisting of bidirectional delay lines (digital waveguides) and interpolated port...
متن کاملArticulatory Vocal Tract Synthesis in Supercollider
The APEX system [1] enables vocal tract articulation using a reduced set of user controllable parameters by means of Principal Component Analysis of X-ray tract data. From these articulatory profiles it is then possible to calculate cross-sectional area function data that can be used as input to a number of articulatory based speech synthesis algorithms. In this paper the Kelly-Lochbaum 1-D dig...
متن کاملMixed physical modeling techniques applied to speech production
The Kelly-Lochbaum transmission-line model of the vocal tract started the discrete-time modeling of speech production. More recently similar techniques have been developed in computer music towards a more generalized methodology. In this paper we will study the application of mixed physical modeling to speech production and speech synthesis. These approaches are Digital Waveguides (DWG), Finite...
متن کاملEstimation studies of vocal tract shape trajectory using a variable length and lossy kelly-lochbaum model
This work demonstrates the use of a modified KellyLochbaum (KL) vocal tract (VT) model in dynamic mapping from speech signals to articulatory configurations. The sixteen section KL model is equipped with a variable length segment for lip rounding and an accurate model for lip radiation impedance. Profiles for the eight Finnish vowels are used to form so called anchor points in the articulatory ...
متن کاملArticulatory synthesis of formant targeted sounds with parameters derived from the inverse solution of speech production
A new approach to produce high fidelity speech sounds by applying both the inverse solution of speech production and the pitchsynchronousarticulatory synthesis technique is presented. Given a formant trace target, the dynamic vocal-tract area function together with time variant VT length are estimated using an inverse solution of speech production. The improved Kelly-Lochbaum filter of the synt...
متن کامل